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TRIVIAL INTERSECTION OF cr-FIELDS AND GIBBS SAMPLING 

By Patrizia Berti, Luc a Pratelli and Pietro Rigo 

Universitd di Modena e Reg gio- Emilia, Accademia Navale di Livorno 
and Universitd di Pavia 

Let (Q,,P,P) be a probability space and Af the class of those 
F G T satisfying P(F) £ {0, 1}. For each £?_C T % defin e Q = a{Q Li AT). 
Necessary and sufficient conditions for AdB — A n B, where A, B C T 
are sub-cr-fields, are given. These conditions are then applied to the 
(two-component) Gibbs sampler. Suppose X and Y are the coordi- 
nate projections on (£l,P) = (X x y, U <g> V) where (X,U) and (y,V) 
are measurable spaces. Let (X n ,Y n ) n >o be the Gibbs chain for P. 
Then, the SLLN holds for {X n ,Y„) if and only if a(X) n a(Y) =Af, 
or equivalently if and only if P(X G U)P(Y G V) = whenever U eU, 
V G V and P(U x V) = P{U C x V c ) = 0. The latter condition is also 
equivalent to ergodicity of (X„,Y„), on a certain subset So C O, in 
case T = U ® V is countably generated and P absolutely continuous 
with respect to a product measure. 



1. The problem. Let (fi, J-, P) be a probability space and A, B C J- sub- 
cr-fields. Letting 

N={FeF:P(F) G {0,1}} and G = a(gL)Af), 

for any subclass Q C J 7 , we aim at giving conditions for 



(l) AnB = AnB. 

2. Motivations. Apart from its possible theoretical interest, there are 
three (nonindependent) reasons for investigating (1). 

2.1. Iterated conditional expectations. Given a real random variable Z 
satisfying £J{|Z|log(l + \Z\)} < oo, define Zq = Z and Q n = A or Q n = B 
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as n is even or odd. By a classical result of Burkholder and Chow [2] and 
Burkholder [4], one obtains 

(2) Z n :=E(Z n ^\g n )^E(Z\AnB) a.s. 

A natural question is whether E(Z \ An B) can be taken as the limit in (2). 

Corollary 2.1. Z n — > E(Z \ AO B) a.s., for all real random variables 
Z such that E{\Z \ log(l + \Z\)} < oo, if and only if condition (1) holds. 

Proof. Under (1), just note that E(Z \AC\B) is a version of E(Z \ 
A n B) . Conversely, suppose Z n — > E (Z \ A D B) a.s. for all Z. Since A fl B C 
A n B, it suffices to prove that AnBc AnB. Given F £Af)B, condition 
(2) implies I F = E(I F \AnB) = E(I f \ A n B) a.s. Letting F = {E(I F \ 
AnB)^ I F }, and noting that P(F ) = 0, yields 

F = (F n F ) u I ^4 n B) = 1} n F C ) e 77TB. □ 

As an application, think of a problem where E{- \ A) and E{- \ B) are easy 
to evaluate while E(- \ A (1 B) is not. In order to estimate E(Z \ AnB), one 
strategy is using condition (2), but this is possible precisely when (1) holds. 

2.2. Sufficiency. Suppose that, rather than a single probability measure 
P, we are given a collection M of probability measures Q on (f2,.F). For any 
Q C T define Qu = o~{Q L)Mm), where Mm is the class of those F G T such 
that Q(F) = for all Q G M or Q(F) = 1 for all Q G M. In this framework, 
condition (1) turns into 

(i*) A M nB M = (AnB) M - 

A sub-cr-field Q C J 7 is sufficient (for M) in case, for each F £L T, there is 
a (/-measurable function f:Q — > 1R which is a version of Eq(I f \ Q) for all 
QGM. 

Generally, sufficiency of both A and B does not imply that of A fl 23. By 
Theorem 4 of [3], however, „4n23 is sufficient provided A and S are sufficient 
and at least one of them includes Mm- This implies the following result. 

Corollary 2.2. AtlB is sufficient whenever A and B are sufficient 
and condition (1*) holds. 

Proof. We first verify that Q is sufficient if and only if Qm is sufficient, 
where Q C T is any sub-cr-field. The "only if" part is trivial. Suppose Qm 
is sufficient, fix F G T ' , and take a Qm -measurable function / which is a 
version of Eq{I f \ Qm) for all Q G M. Since 

Qm = {E G J- : there is G G Q such that Q(FAG) = for all Q G M}, 
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for each n there is a (/-measurable function <p n such that Q(\f — 4> n \ < ^) = 1 
for all Q € M. For define 4>(uj) = lim n 4> n {oo) if the limit exists and 

4»{oj) = otherwise. Then, (f> is ^-measurable and Q(/ = (p) = 1 for all Q G 
M. Thus, (j) is a version of Eq(If \ Q) for all Q £ M, which shows that Q is 
sufficient. Next, since A and are sufficient, Am and #m are still sufficient, 
and thus (^4 R B) m = Am R £>m is sufficient by Theorem 4 of [3] . Therefore, 
A R £> is sufficient. □ 

2.3. Two- component Gibbs sampler. Suppose (Q,J~) = (X x y,U® V) is 
the product of two measurable spaces (X, IX) and (3^, V) and let X : — *■ X, 
Y : —> y be the coordinate projections. Suppose also that regular versions 
of the conditional distribution of Y given X and X given Y are avail- 
able under P (precise definitions are given in Section 4). Roughly speak- 
ing, the Gibbs-chain (X n ,Y n ) n >o can be described as follows. Starting from 
uj = (x,y), the next state u>* = (x*,y*) is obtained by first choosing y* from 
the conditional distribution of Y given X = x and then x* from the condi- 
tional distribution of X given Y = y*. Iterating this procedure produces a 
homogeneous Markov chain (X n ,Y n ) with stationary distribution P. Let P 
denote the law of such a chain when (Xq,Yq) ~ P, and let 

-. n— 1 

m n (0) = -5Xx^) 

for each function (j) : O — > R. 

In real problems, (X n ,Y n ) is constructed mainly for sampling from P. To 
this end, it is crucial that the SLLN is available under P, that is, 

(3) m n ((j>) J (f>dP P-a.s., for all <p £ Li(P). 

Note that, under (3), for each probability measure Q <C P one also obtains 




Q-a.s., for each cf> € Li(P) 



where Q is the law of the chain (X n ,Y n ) when (Xo,Yo) ~ Q. 

In addition to (3), various other properties are usually requested to (X n ,Y n ), 
mainly ergodicity, CLT and the convergence rate. Nevertheless, condition (3) 
seems (to us) a fundamental one. It is a sort of necessary condition, since the 
Gibbs sampling procedure does not make sense without (3). Accordingly, we 
say that P is Gibbs- admissible in case (3) holds. 

But what about condition (1)? The link is that P turns out to be Gibbs- 
admissible precisely when 



a(X) R a(Y) =N = a(X) R a(Y). 
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In other terms, the Gibbs sampling procedure makes sense for P (in the 
SLLN-sense) if and only if P meets condition (1) with A = o~(X) and B = 
a(Y). 

In fact, something more is true. Let K be the transition kernel of (X n ,Y n ) 
and So = {uj £ il : K(lj, •) <C P}. Under mild conditions [T countably gener- 
ated and P absolutely continuous with respect to a product measure), one 
obtains P(Sq) — 1 an d 

a(X) n a(Y) =N O (X n ,Y n ) is ergodic on S . 

For proving the previous statements, a key ingredient is a result of Diaco- 
nis et al., connecting the Gibbs-chain (X n ,Y n ) with the Burkholder-Chow 
result of Section 2.1; see Theorem 4.1 of [6]. Indeed, o~{X) n cr{Y) =N ap- 
pears as an assumption in various results from [6] (Corollary 3.1, Proposi- 
tions 5.1, 5.2 and 5.3). Also, investigating cr(X) n cr(Y) =N was suggested 
to us by Persi Diaconis. 

One of our main results (Corollary 3.5) is that cr{X) n cr(Y) = M is equiv- 
alent to the following property of P: 

P(X eU) = or P(Y G V) = 

whenever U G U, V £ V and P(U x V) = P(U C x V c ) = 0. 

This paper is organized into two parts. Section 3 gives general results on 
conditions (1) and (1*). It includes characterizations, examples and various 
working sufficient conditions in case P is absolutely continuous with respect 
to a product measure. Under this assumption, it is also shown that P is 
atomic on a{X) Pia(Y) (with X and Y as in Section 2.3). The main results 
are Theorems 3.1 and 3.10 and Corollaries 3.5, 3.7 and 3.9. Section 4 deals 
with the Gibbs sampler and contains the material sketched above. The main 
results are Theorems 4.2, 4.4 and 4.5. 

3. General results. This section is split into three subsections. All ex- 
amples are postponed to the last one. 

3.1. Necessary and sufficient conditions. Condition (1) admits a surpris- 
ingly simple characterization. 

Theorem 3.1. Let 

J = {An B :A£ A,B e B and P{A n B) + P{A C n B c ) = 1}. 
Then, ~AC\B = 7J- Moreover, A n B = AnB if and only if 



A£ A,B £ B and P(A nB) = P(A C n B c ) = 

implies P(AAD) = or P(BAD) = for some DeAnB. 
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Proof. First note that, for any sub-o~-field Q GJ-, one has 

g = {F eF:P(FAG) =0 for some G G Q}. 

Let F G A n B. Then, P(AAF) = P(BAF) = 0, for some A G A and P G B, 
and 

1 - P(4 flB)- P(,4 C fl B c ) = P(AAB) < P(AAF) + P(BAF) = 0. 

Hence, J := AnB E J. Since P(FAJ) < P(AAF) + P(BAF) = 0, then 
F <E J. Conversely, let J = A n 5 € J where 4 S i, B G B and P(A fl 
B) + P(A C n P c ) = 1. Define H = (A n P)_U (A^nB c ). Since_P(iT) = l_and 
J = A n P = P_n H, it f ollows that J G i ng. T herefore, .A n B = J". In 
particular, AnB = An B if and only if C A n B, and thus it remains only 
to show that J C AnB is equivalent to condition (4). Suppose (4) holds 
and fix J G J . Then, J can be written as J = A fl P° for some A G A and 
P G B with P Q4 n P) = P(,4 C n B c ) = 0. By (4), it follows that A G lAnB 
or P € .4 (~l B, say Agi n B. Since P(A n P) = 0, on e obtains J = An 
B c = A — (An B) e An B. Finally, suppose J C A n B and fix AgA and 
P G B with P(4 n P) = P(A C n P c ) = 0. Sin ce ^ n B c ej cAnB and 
P(yl n P) = 0, then A = (An B)U (An B c ) e AnB, that is, P(AAP) = 
for some D G A n B. □ 

The case of > 2 sub-cr-fields can be settled essentially as in Theorem 
3.1. 

Theorem 3.2. Let A\, . . . ,Ak C T be sub-a -fields, k>2. Then, 

k k 

f]A~=f]Ai 

i=i i=i 

i/ and on/y 2/ 

and p(j)A^+p(j\)A$J=l 

k 

implies P(A{AD) = for some i and D G O .4j. 

i=l 

The simple argument which leads to Theorem 3.1 allows to find conditions 
for sufficiency of A fl B as well. 

Theorem 3.3. In the notation of Section 2.2, Am n Bm = (AnB) m if 
and only if 

A£A,BeB and Q(A n B) = Q(A C n P c ) = for all Q e M 
(4*) implies the existence of D £ AnB such that 

Q(AAD) = /or a// Q G M or Q(BAD) = /or a// Q G M. 
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Hence, by Corollary 2.2 and Theorem 3.3, Af)B is sufficient whenever A 
and B are sufficient and condition (4*) holds. 

The proofs of both Theorems 3.2 and 3.3 have been omitted since they 
are quite analogous to that of Theorem 3.1. 

Let us come back to the main concern of this paper, that is, a single prob- 
ability measure P and two sub-cr-fields A and B. Condition (4) of Theorem 
3.1 trivially holds provided 

A 6 A, B G B and P(A n B) = P(A C nB c ) = 

(5) 

implies P(A) = or P(B) = 0. 

Generally, condition (5) is stronger than (4) [just take A = B, so that (4) 
trivially holds, and choose P such that (5) fails]. However, (4) and (5) are 
equivalent in a significant particular case. 

Corollary 3.4. If A n B cAf, then 

Ad B = J\f condition (4) holds 44> condition (5) holds. 

PROOF. Suppose (4) holds and HxAeA,BeB with P(AnB) = P(A C D 
B c ) = 0. Then, P(AAD)P(BAD) = for some D G AnB, say P(AAD) = 0. 
Since A n B C M, one obtains = P{D) G {0, 1}. If P(A) = 0, then 

P(A)P(B) = 0. If = l,th en P(B) = P(AnB) = and again P(A)P(B) = 
0. Thus, (5) holds. Since AnB = TV, an application of Theorem 3.1 concludes 
the proof. □ 

In the sequel, we deal with product measurable spaces. Let {X,U) an d 
(3^, V) be measurable spaces and 

(n,F) = (X xy,U®V), X(x,y)=x, Y(x,y)=y, 

where (x, y) G X x y. We focus on A = cr(X) and B = cr(Y) and we let 

V = ^X)r\a(Y). 

On noting that a(X) n cr(Y) = {0,fi}, Corollary 3.4 implies the following 
statement. 

Corollary 3.5. V = M if and only if 

UeU,VeV and P(U x V) = P{U C x V c ) = 

(6) 

implies P(X G U) = or P(Y G V) = 0. 

By Corollary 3.5, if "D 7^ A/", then D includes a rectangle U xV such that 
C/ £ W, 7 6 V and < P(C7 x V) < 1. This fact implies a first sufficient 
condition for T> = Af. 
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Corollary 3.6. For V = M, it is sufficient that 

E{E(I UxV | a{X)) | a(Y)) + E{E{I UxV | <r(Y)) \ a{X)) > a.s. 

(7) 

whenever U eU,V G V and < P(P x V) < 1. 

Proof. Let P € W and V G V. If P x V G P, then 
E{E{I UxV | <rpO) | <r(Y)) + E(E(I UxV \ a(Y)) | <r(X)) = 2Pj xV a.s. 
Thus, (7) implies U xV <£V in case < P(P x V) < 1. □ 

3.2. Sufficient conditions in case P is absolutely continuous with respect 
to a product measure. In real problems, P usually has a density with respect 
to some product measure. Let \i and v be cr-finite measures on U and V, 
respectively. In this subsection, P <C fi x v and / is a version of the density 
of P with respect to fi x v. 

Corollary 3.7. For V = N ', it is sufficient that P«/iXi/ and 
(P xy)u(Xx V ) D{f>0}DU xV 
for some Po G U and Vq G V suc/i i/mf P(Po x Vb) > 0. 

Proof. By Corollary 3.5, it suffices to prove condition (6). Let P G U 
and V G V be such that P(P x V) = P(P C x V c ) = 0. Since / > on P x Vb 
and 

/ /d(/i x v) = p((unu ) x {vnv ))<p{u xV) = o, 

JunUoxVnVo 
it follows that 

n(u n u )v(y nv ) = nx u((u n P ) x(vn v )) = o, 

say n(U n P ) = 0. Similarly, P(U C x V c ) = and / > on P X Vb imply 
fi(U - U)v{Vq - V) = 0. Since P(X G P ) > and /i(P n P ) = 0, it must 
be fi(U -U)>0 and thus u(V - V) = 0. Let P = (C^o x U (X x V ). 
Since P(P ) = 1 and P(X G P n P ) = P(Y G V - V) = 0, one obtains 

P(X G P) = P({X G U - U } n P ) = P((U - P ) x V ) 

= p((p - p ) x (v n vb)) < P(u x v) = o. 

Therefore, condition (6) holds. □ 

Corollary 3.7 applies in particular if {/ > 0} D Po x 3^ f° r some Po G U 
with P(X G Po) > (just take V = y). Likewise, it applies if {/ > 0} D 
A' X V for some Vb G V such that P(Y G V ) > 0. 
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Let fiQ be a probability measure on U equivalent to \x, that is, fio <C fi 
and 11 <C /Uo- Similarly, let fo be a probability measure on V equivalent to v. 
Then, fiQ x uq is equivalent to fix v and, for each H G J 7 with /x x i^(-fT) > 0, 
one can define the probability measure 

Ho x v (H) 

Say that has the trivial intersection property, or briefly that H is TIP, 
in case H G T , fi x z/(-ff) > and V = N holds when P = Note that 
whether or not -ff is TIP does not depend on the choice of fio and vq. Note 
also that 

V=M the set {/ > 0} is TIP. 



Corollary 3.8. Suppose PCjuxy and {/ > 0} = (J n H n , where Hi C 
C • • • is an increasing sequence of TIP sets. Then, T> = Af . 

Proof. Let H = {/ > 0} and U G U, V G V with Q H (J7 x V) = Qh(C/ c x 
y c ) = 0. Since H n C then Q Hn (U x V) = Q Hn (U c x V c ) = for all n. 
Since F n is TIP, Corollary 3.5 (applied to Q Hn ) yields Q Hn ( x € U)Q Hn (Y G 
V) = for all n. Thus, H n ] H implies 

Q H {X G U)Q h (Y eV) = HmQ Hn (X G U)Q Hn (Y G V) =0. 
By Corollary 3.5, H = {/ > 0} is TIP. □ 

In order to generalize Corollary 3.8, one more piece of terminology is 
useful. Given two sets F,G G T , say that F communicates with G in case at 
least one of the following conditions (i) and (ii) holds: 

(i) there is V G V with u(V ) > and fi(F y ) > 0, n(G») > for all y G Vo, 

(ii) there isU eU with fi(U ) > and u(F x ) > 0, v(G x ) > for all x G U , 

where F x = {y £ y-. (x,y) G F} and F y = {x G X : (x,y) G F} denote the 
sections of F. 

Corollary 3.9. Suppose P <g fx x v and {/ > 0} = \J n H n , where H n 
is TIP and H n communicates with H n+ \ for each n. Then, V = M. 

Proof. It is enough to prove that F U G is TIP whenever F and G 
are TIP and F communicates with G. In that case, in fact, since U^i 1 ^» 
communicates with H n , a simple induction implies that U£=i Hi is TIP for 
all n. Hence, T> = J\f by Corollary 3.8. 
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Suppose F and G are TIP and condition (i) holds [the proof is the same 
if (ii) holds]. Set H = F U G and fix U G U, V G V with Q H (U x V) = 
Qh(U c x V c ) = 0. Since F and G are TIP and 

Q F (tf xV) = Q F (U C x V c ) = Q G (U xV) = Q g (U c x V c ) = 0, 

one obtains 

Q F (XeU) = or Q F (YeV) = and 

Q G (^ £ U) = or Q G (Y G V) = 0. 

Let V be as in condition (i). If Q F {X G U) = 0, then Q F (Y G V) = 1. By 
(i) and since /io and uq are equivalent to [i and i^, it follows that 

f v u (Fy)v (dy) 
/x x i/ (if) 

Hence ^(V n Vb) > 0, and this implies 

QcO- € F) > Q C (Y evnv ) = > 0. 

Mo x ^ol^J 

Therefore, Q F (X eU)=0 implies Qg(Y G V) > 0, and similarly Q G {X G 
17) = implies Q F (Y G V) > 0. It follows that Q F (X EU) = Q G (X eU)=0 
or Q F (y G V) = Q G (y G V) = 0, which implies Q H (X G 17) = or Q H (y G 
V) = 0. Thus, condition (6) holds for Q H , and H = FUG is TIP by Corollary 
3.5. □ 



So far, conditions for P to be 0-1-valued on T> have been given. A weaker 
but useful result is that the latter property holds locally, in the sense that 
Q can be partitioned into sets Hi,H2,--- G T> such that P(H n ) > and 
P(- | H n ) is 0-1-valued on T> for each n. We now prove that this is always 
true provided ?</ixi/. In that case, in fact, P is atomic on T>. Recall 
that, given a probability space (Z,£,Q), a Q-atom is a set K G £ such that 
Q(K) > and Q(- \ K) is 0-1-valued. In general, there are three possible 
situations: (j) Q is nonatomic, that is, there are no Q-atoms; (jj) Q is atomic, 
that is, the Q-atoms form a partition of Z; (jjj) there is K G £, < Q(K) < 1, 
such that Q(- \ K) is nonatomic and K c is a disjoint union of Q-atoms. 

Theorem 3.10. IfP^fixv, then P is atomic onV (i.e., the restric- 
tion of P to T> is atomic). 

Proof. Fix H G V with P{H) > and let Ph denote the restriction 
of P(- | H) to T>. If Pjj is nonatomic, the probability space (£1,D,Pjj) 
supports a real random variable with uniform distribution on (0,1); see, 
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for example, Theorem 3.1 of [1]. Hence, it suffices to prove that each Im- 
measurable function Z .0, — ► K satisfies Ph(Z £ C) = 1 for some count- 
able set CcR. Let Z :Q, — ► R be P- measurable. Since o~(Z) C cr(X), there 
is a ZY-measurable function h : X — > K satisfying Z = a.s. Similarly, 

o~(Z) C <r(y) yields Z = fc(Y) a.s. for some V-measurable function k : y — > R. 
Let C = {c G K : ^{y : /c(y) = c} > 0}. Since i/ is cr-finite, C is countable and 



J{x:h(x)£C} 

Since P <C /i x ^ and Z = = A;(y) a.s., it follows that 

p(z g c) = i - p(/i(x) £ c, = fc(y)) = i. 

Thus, Pfj <C P implies Ph{Z G C) = 1. This concludes the proof. □ 

3.3. Examples. In this subsection, X and 3^ are topological spaces and 
IA and V the corresponding Borel c-fields. Moreover, ^ and v have full topo- 
logical support (i.e., they are strictly positive on nonempty open sets) and 
P has a density / with respect to fx x v. 

We note that, since \x and v have full topological support, F communicates 
with G whenever F,G G T and F V\G has nonempty interior. Further, by 
Corollary 3.9 (see also its proof), F U G is TIP whenever F and G are TIP 
and F communicates with G. 

Example 3.11. Let X and y be second countable topological spaces. 
If {/ > 0} is open and connected, then V = TV. 

Suppose in fact that H C f2 is open and connected. Since -ff is open, 
H = (J n where each is open and TIP (for instance, take the H n as 
open rectangles). For U\,UJ2 £ H, say that uj\ ~ll>2 in case there are a finite 
number of indices j\ , . . . , j n such that u\ G fl"^ , u>2 G iTj n and Hj i fl 7^ 
for each i. Then, ~ is an equivalence relation on H. Since H is connected 
and the equivalence classes of ~ are open, there is precisely one equivalence 
class, that is, u\ ~ 002 f° r ah ^1,^2 G H. Fix ujq G -ff . For each k, take G iiffc 
and define M k = H k U (UiLi ^j'*)) where ji,... , J n are such that 0;^ G -f/j^, 
w G H jn and iJ,. n + for all i. By Corollary 3.9, M k is TIP. Thus, 
H = \J k Mfc is TIP as well, since u>o G M k n M^+i for each fe. 

Note that {/ > 0} is open provided / is lower semicontinuous. Thus, when 
/ is lower semicontinuous (and X, y are second countable and locally con- 
nected), a sufficient condition for T> = J\f is that the connected components 
Hi, H2, ■ ■ ■ of {/ > 0} can be arranged in such a way that H n communicates 
with H n+ i for all n. This follows from Example 3.11 and Corollary 3.9. Note 
also that, in case X = W 1 , y = M m and \i, v the Lebesgue measures, P ad- 
mits a lower semicontinuous density as far as it admits a Riemann-integrable 
density. 
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Example 3.12. Let X = W 1 , y = W 71 and [i, v the Lebesgue measures. 
If {/ > 0} is convex, then V = M. 

Suppose in fact that H € J- is convex and fj, x v(H) > 0. Since H is convex, 
x u(H — H°) < \i x v(dH) = where iJ° and are the interior and the 
boundary of H. Thus, it suffices to note that H° is open and connected. 

In various frameworks, for instance in Bayesian statistics, P is usually a 
mixture of some other probability laws. 

Example 3.13. Let (0,£,tt) be a probability space and {Pg :6 £ 0} a 
collection of probabilities on T such that 9 i— > Po{H) is £"-measurable for 
fixed PL . Define 

P = J P e ir(d6). 

Then, condition (6) can be written as 

U£U,V£V and P e (U x V) = P e {U c x V c ) = 0, 7r-a.s., 

(8) 

implies P e (A" 6 U) = 0, 7r-a.s. or P (y e7) = 0, 7r-a.s. 

Several sufficient conditions for T> = Af can be deduced from (8). For in- 
stance, T> = M provided each Pg meets condition (6) [i.e., (6) holds when 
P = Pg] and 

(9) Ti{6:P e {U xV) = l}>0 P (UxV)>O, vr-a.s., 

for all U e U and V £V. Suppose in fact Pg{U x V) = P e (U c x V c ) = 0, vr-a.s., 
for some U G U and V G V. Since each Pg meets (6), then Pg(U x l/ c ) G {0, 1}, 
7T-a.s. If Pg(U x F c ) = 0, 7r-a.s., then Pg(X eU) = 0, ir-a.s. Otherwise, if 

tt{6:Pq{U x V c ) = l} = Tr{6:Pg(U x V c ) > 0} > 0, 

condition (9) yields Pg({7 x V c ) > 0, 7r-a.s. Hence, Pg(Y G V) = 0, 7r-a.s. One 
more sufficient condition for T> = AT applies when each Pg has a density fg 
with respect to fi x v. In that case, T> = M whenever 

{fg > 0} is TIP for all 6 and 

(10) 

{fg 1 > 0} communicates with {fg 2 > 0} for all 61,62- 

Suppose in fact P e .(U x V) = P 9l (U c x V c ) = 0, i = 1,2, for some U G W, 
FG V and 61,62 G 6. By (10), 

> 0} U {/ 02 > 0} is TIP. 

Using this fact, it is straightforward to verify that Pg 1 (A A €.17) = Pg 2 {X G 
[/) = or P fll (YeV) = Pg 2 (Y € V) = 0. Therefore, condition (8) holds. 
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Next two examples answer questions posed by Persi Diaconis and David 
Freedman. 

Example 3.14. Let (X, d\) and (y, dz) be metric spaces and Q = X x y 
be equipped with any one of the usual distances 

d{u,u*) = \Jdi(x,x*) 2 + d 2 (y,y*) 2 , d(u,u*) = di(x,x*) V d 2 (y,y*), 
d(uj,uj*) = d\(x, x*) + d2(y, y*) where uj = (x, y) and uj* = (x* , y*). 

By Corollary 3.7, under any such d, the balls in O are TIP. Let D\ C X and 
D2 C y be countable subsets and (x±, y±), (x2, f/2), • • ■ any enumeration of 
the points of D\ x L>2- Suppose {/ > 0} = (J n FT n , where H n € J 7 is an open 
ball centered at (x n ,y n ). For some k, the ball is centered at (xi,y2)- 
Then, Hi communicates with and communicates with H2, and we 
let ji = 1, J2 = & and j'3 = 2. Next, for some m, the ball _ff m is centered at 
(a?2 , 2/3) • Then, H2 communicates with H m and H m communicates with ^3, 
and we let j'4 = m and j'5 = 3. Arguing in this way, {/ > 0} can be written 
as {/ > 0} = \J n Hj n where Hj n communicates with Hj n+1 for all n. Since 
each Hj n is TIP, Corollary 3.9 implies V = M . 

Example 3.15. In the notation of Example 3.14, suppose {/ > 0} = 
({J n H n ) c . Let I n = {x : (x,y) 6 H n for some y} be the projection of H n on 
X. Since H n is open, I n is open as well. Suppose also that J2n^(^n) < 
H{X). Then, Corollary 3.7 yields V = M.ln fact, {/ = 0} C (U„ 4) X y and 
^(Un-^n) < Hn^n) < ■ Letting U = X - (U n 4), thus, one obtains 
{/ > 0} D C/ x J and P(X G C/ ) > 0. 

Let us turn now to P ^ A/". Generally, the complement of a TIP set need 
not be TIP. One consequence is that, in spite of Corollary 3.8, the intersec- 
tion of a decreasing sequence of TIP sets need not be TIP. 

Example 3.16. Let X = y = (0, 1), n = v = Lebesgue measure, F = 
(0, |) x (0, \) and G n = (| - ±, 1) X (|, 1). Since F and G n are TIP and F 
communicates with G n , then iJ n = FL)G n is TIP. Further, H n is a decreasing 
sequence of sets. However, 

# = ntfn = FU([±,l)x(±,l)) 

n 

is not TIP. In fact, < Qh(F) < 1, Qh{H) = 1 and 

f = ((0, i) x (0, 1)) n = ((0, 1) x (0, i)) n 

Finally, we exhibit a situation where T> ^ M though P is absolutely con- 
tinuous (with respect to Lebesgue measure) and has full topological support. 
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Example 3.17. Let X = y = (0, 1) and \i = v = Lebesgue measure. Sup- 
pose {/ > 0} = {(x, y) : x, y € I or x, y € (0, 1) — /}, where / is a Borel subset 
of (0, 1) satisfying 

< fj,(I n J)< A 4 (J) for each nonempty open set J C (0, 1). 

Since < P(I X I) < 1 and 



then V^M. Moreover, P(J X x J 2 ) > P(/n Ji x Jn J 2 ) > whenever Ji, J 2 C 
(0, 1) are nonempty open sets, since fj,(I n Jj) > for i = 1, 2. Thus, P has 
full topological support. 

4. Two-component Gibbs sampler. The Gibbs sampler, also known as 
Glauber dynamics, plays an important role in scientific computing. A de- 
tailed treatment can be found in various papers or textbooks; see, for ex- 
ample, [5, 7, 8, 10] and references therein. In this section, the Gibbs-chain 
is shown to meet the SLLN (Gibbs-admissibility of P) if and only if condi- 
tion (6) holds. Moreover, under mild conditions (J- countably generated and 
F<C/ixy), condition (6) is also equivalent to ergodicity of the Gibbs-chain 
on a certain subset Sq Cfl. 

In order to define the Gibbs sampler, Y is assumed to admit a regular 
version of the conditional distribution given X , say a = {a{x) : x € X}. Thus: 
(i) a(x) is a probability measure on V for x £ X; (ii) x i— > a(x)(V) is im- 
measurable for V eV; (iii) P{U X V) = J u a(x){V)P o X" 1 ^) for U G 
U and V € V. Similarly, X is supposed to admit a regular version of the 
conditional distribution given Y, say (3 = {(3(y) :y € 

The Gibbs-chain (X n ,Y n ) n >o has been informally described in Section 
2.3. Formally, (X n ,Y n ) is the homogeneous Markov chain with state space 
(O,^ 7 ) and transition kernel 



Note that P is a stationary distribution for the chain (X n ,Y n ). Denote P 
the law of (X n ,Y n ) when (Xq,Yq) ~ P, and P w the law of (X n ,Y n ) given 
that (X ,Y )=uj. 

Any distributional requirement of (X n ,Y n ) (such as SLLN, CLT, ergod- 
icity, rate of convergence) depends only on the choice of the conditional 
distributions a and j3. At least if U and V are countably generated, how- 
ever, a and (3 are determined by P up to null sets. Thus, one can try to 
characterize properties of (X n ,Y n ) via properties of P. Here, we first focus 



I x I = (I x (o, i)) n {/ > 0} = ((o, l) x i) n {/ > 0} 




where U G U, V 6 V and to = (x, y) € fi. 
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on the SLLN and then on ergodicity. Recall that, in Section 2.3, P has been 
called Gibbs-admissible if 



We need the following result. 

Theorem 4.1 [6]. Given a bounded measurable function 4>:Q^M., de- 
fine 0o = 4>, Qn = or Q Tl = cr(Y) as n is even or odd, and 4> n = E(4> n -i I 
Q n ). Then, 

E((f>(X n ,Y n ) | (Xq,Yq) = uj) = <f>2n(&) for all n and P-almost all uj. 

The previous Theorem 4.1 is a version of Theorem 4.1 of [6]. In the latter 
paper, the authors focus on densities so that P is assumed absolutely con- 
tinuous with respect to a product measure. However, such an assumption 
can be dropped, as it is easily seen from the proof given in [6]. 

In view of Theorem 4.1 and the Burkholder-Chow result mentioned in 
Section 2.1, Gibbs-admissibility and T> = M look like very close conditions. 
In fact, they are exactly the same thing. 

Theorem 4.2. 

P is Gibbs-admissible 44> T> = N <3> condition (6) holds. 

Proof. The equivalence between (6) and V = J\f has been already proved 
in Corollary 3.5. 



Suppose that P is Gibbs-admissible. In order to check (6), fix U G U and 
V G V with P(U xV) = P(U C x V c ) = 0. Then, P(U x V c ) > or P{U C x 
V) > 0, say P(U x V c ) > 0. Let 

Mi = {y E V c : P(y)(U c ) > 0}, U 1 = {xEU: a(x)(V) = a(x)(M l ) = 0}. 

Then, P(U C xV c ) = yields P(Y G Mi) = 0, and P(Y G M x ) = together 
with P(U xV) = imply P(X G U - U\) = 0. By induction, for each j > 2, 



Mj = {yeV c : 0(y)(Uf-i) > 0}, U 3 = {x G : a(x)(Mj) = 0} 

and verify that P(Y G Mj) = = P(X G Uj-i -Uj). Define further Uoo = 
f]j Uj and note that P{X G U^) = P{X G U). Fix uj = (x,y) e^x V c . 
Given j, since a{x){Mj + \) = and (3{b){Uj) = 1 for each b GV C — Mj+i, the 
transition kernel K satisfies 




P-a.s., for all 0GLi(P) 



where m n {4>) = ± Ei=o <f>( x i, Y i)- 



define 
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Thus, K(lo, Uoo x V c ) = 1 for all u G Uca x ^ c and this implies 

Pu{X n (^U 00 ,Y n eV c for all n) = 1 for all we^x y c . 

Next, by Gibbs-admissibility of P (with = IuxV c )i there is a set S G F 
with P(S) = 1 and 

limm n (/[/ x y c ) = P(U x F c ) P^-a.s., for all u G 5. 

n 

Since P(Sn (C/qo x F c )) = P([/ x y c ) > 0, there is a point lu G <Sn (C/qo x V c ). 
For such an uq, one obtains 

P([7 x V c ) = limm n (I UxV c) = 1 P^-a.s. 

Therefore P(Y G V) = 0, that is, condition (6) holds. 

Finally, suppose X> = N . By the ergodic theorem, since (X n , Y n ) is station- 
ary under P, for P to be Gibbs-admissible it is enough that P be 0-1-valued 
on the shift-invariant sub-u-field of J-°° . Let h be a bounded harmonic func- 
tion, that is, h : Q, — > M is bounded measurable and h(u>) = J h(t)K(u, dt) for 
all to G f2. Because of Theorem 4.1 and /i harmonic, 

/i(u;) = E(h(X n ,Y n ) | (Xo, ^o) = oj) = /i2n(^) for all n and P-almost all w. 
By the Burkholder-Chow result (Section 2.1) and T> = M, one also obtains 

h 2n -> P(/i \V)= J hdP P-a.s. 

Hence, /i(u;) = / hdP for P-almost all to. Let G be such that H = 
6~ l H, where is the shift transformation on Then, 

h{u)=P U) (H) 

is a bounded harmonic function satisfying In = lim n h(X n , Y n ), P-a.s.; see, 
for example, Theorem 17.1.3 of [9]. Since (X n , Y n ) is stationary under P, then 
I H = h(X ,Y ), P-a.s. Hence, P(h = 0) = 1 or P(h = 1) = 1, which implies 
F(H) = JhdP G {0, 1}. This concludes the proof. □ 

Theorem 4.2 is potentially useful in real problems as well, since it singles 
out those P such that Gibbs sampling makes sense; see Section 2.3. 

The next example is motivated by, in Gibbs sampling applications, the 
available information typically consisting of the conditionals a and j3. 

Example 4.3. Condition (6) can be stated in terms of the conditional 
distribution a = {a{x):x G X} of Y given X. Let V G V. If P(U x V) = 
P(U C x V c ) = for some U G U, then a(X){V) = I V c(X) G {0, 1} a.s. Con- 
versely, a(X)(V) G {0,1} a.s. implies P(U x V) = P(U C x V c ) = with 
U = {x : a(x)(V) = 0}. It follows that (6) is equivalent to 



(11) a(X)(V) G{0,1} a.s. 



a(X)(V) = a.s. or a(X)(V) = 1 a.s. 
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for all V G V. Thus, whether or not P is Gibbs- admissible depends only 
on the "supports" of the probability laws a(x), x G X . Condition (11) also 
suggests various sufficient criteria. Let 

V = {V G V : < P(a(X)(V) = 1) < 1}. 

Condition (11) trivially holds whenever V ^ Vo- Hence, a first (obvious) 
sufficient condition for Gibbs-admissibility of P is 

(12) a(X)(V) > a.s. for each V G V . 

A second condition is the following. Let X be a metric space and IA the Borel 
(T-field. Then, P is Gibbs-admissible provided there is a set T £U satisfying: 

(i) P(X G T) = 1 and P(X eU)>0 for all open U C X such that T n 

(ii) for each V G Vo, the map x i— > a(x)(V) is continuous on T; 

(iii) for each V G Vo, there is x 6 T such that < a(x)(V) < 1. 

Fix in fact V € V . Then, (ii)-(iii) imply {x G T : < a(x){V) <l} = TnU^ 
for some open UcX. Thus, P(0 < a(X)(V) < 1) = PpT G ET ) > by (i). 
Therefore, condition (11) holds. Note that, in view of (ii), condition (iii) is 
certainly true if T is connected. Similarly, (iii) holds if, for each V G Vo, the 
map x i — > a(x)(V) is not constant on some connected component of T. 

In applications, it is useful that (X n ,Y n ) is ergodic on some known set 
S G T. By ergodicity on S, we mean S £ J 7 and 

P(5) = l, K{cu,S) = l and || A' n (w, •) - P|| -» for all cu G 5, 

where || • || is total variation norm and A n the nth iterate of K. If (X n ,Y n ) 
is ergodic on S 1 , for each w£5 one obtains 

m n (4>) ^ J (j)dP Pcj-a.s., for all <j> G £i(P). 

Thus, ergodicity on some 5 implies Gibbs-admissibility of P. We now seek 
conditions for the converse to be true. 
To this end, an intriguing choice of S is 

So = {wefi:%.)«P}. 

A simple condition for Sq £L J- \s J- countably generated. 

Theorem 4.4. // JF is countably generated, condition (6) holds and 
P(Sq) = 1, then (X n ,Y n ) is ergodic on Sq. 
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Proof. Since P(S ) = 1, the definition of So gives K(lo,S ) = 1 for all 
to G So. Let Po and Kq(uj,-) be the restrictions of P and K(uj,-) to Pen 
where a; G So and Po = {P H So : P € P}. Then, (A n , Y^) can be seen as a 
Markov chain with state space (So,Po)j transition kernel Kq and stationary 
distribution Pq. Also, 

\\K u (uj, •) - P|| = \\K t 1 (lo, •) - Poll for all u G S . 

By standard arguments on Markov chains, thus, it is enough to prove that 
Kq is aperiodic and every bounded harmonic function (with respect to Kq) is 
constant on So- Let ho be one such function, that is, ho : So — > M is bounded 
measurable and ho(uj) = J ho(t)Ko(u,dt) for all u G So- Define h = ho on 
So and h = on Sg. Then, /i : — > K is bounded measurable and h(u) = 
J h(t)K(u,dt) for P-almost all u. Letting A = {lo G Q : h(u) = J hdP} and 
arguing as in the proof of Theorem 4.2, condition (6) implies P{A) = 1. 
Thus, K(u,A) = 1 for each u G So, so that 

ho{uS) = J ho(t)K (u;,dt) = J h(t)K(u,dt) = J hdP for all u G S . 

It remains to prove aperiodicity of Kq. Toward a contradiction, suppose 
there are d > 2 nonempty disjoint sets Pi, . . . ,Fd G To such that 

Ko(cu, Pj+i) = 1 for all lo G Pj and i = 1, . . . , d, where Fd+i = Pi- 

If p(Fi) = 1, then A" (w,Pi) = 1 for all to G S , contrary to A (w,Pi) = 
for lj G Pi- Hence, P(Pi) < 1. Applying Theorem 4.1 to = , one obtains 

A nrf (o;,Pi) = 4>2nd{^) for all n and P-almost all ui. 

Hence, the Burkholder-Chow result (Section 2.1) and condition (6) yield 

K nd {;F l ) = 4 )2nd ^E{4>\V) = J(t>dP = P{F 1 ) a.s. 

Since lim n K nd (uj, Pi) = 1 / P(P X ) for all u G Pi, it follows that P(Pi) = 0. 
But this is a contradiction, since P(Pi) = implies Kq(uj,F\) = for all 
u G So- Thus, Ao is aperiodic. □ 

By Theorem 4.4, Gibbs-admissibility implies ergodicity on So whenever 
P(So) = 1 (and T is countably generated). In turn, for P(So) = 1, it is 
enough that P ji x i/. 

Theorem 4.5. 7/P is countably generated and P <C yux v, then (X n ,Y n ) 
is ergodic on Sq if and only if condition (6) holds, that is, if and only if P 
is Gibbs-admissible. 
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Proof. It suffices to prove P(So) = 1- Let / be a version of the density 
of P with respect to /i x v and 



fi( x )=J f( x ,y)v( d y), f2(y) = J f(x,y)n(dx). 

Define D 1 = {x : < fi(x) < oo}, D 2 = {y.0< f 2 (y) < oo} and 

D = {xeD 1 :a(x)(D 2 ) = l}. 

Since P(X G Z>i) = P(Y G D 2 ) = 1, then P(X e D) = 1. Fix w = (x,y) G 
D xy. Then, a(x) has density f(x,-)/f\(x) with respect to z/. Also, 
has density /(•, b)/ f 2 (b) with respect to fj, for each 6 G D 2 , and a(x)(Z?2) = 1- 
Hence, for C £ J 7 , 



K(uj,C) = J J I c {a,b)[3(b)(da)a{x){db) 



Ic(a,b) ii{da) v{db) 

M&) h{x) 

1 Ic(a,b)l^f(a,b)rida)v(db) 



fi(x)J J ^ K ' h(b) 

' ' Ic(a,b)%£-P(d(a,b)). 



fi(x)J ° v ' y / 2 (6) 
Therefore, K(u, •) < P, so that P(S ) > P(D xy) = l. □ 



Example 4.3 (Continued). Suppose T is countably generated and P <C 
[i Xv. By Theorem 4.5, (X n ,Y n ) is ergodic on So if and only if a meets 
condition (11). Let / and f\ be as in the proof of Theorem 4.5 and 

l x = {y:f(x,y)>0}. 

Then, condition (11) holds provided 

v{l x n I z ) > whenever < fi(x), fi(z) < oo. 

Suppose in fact a(x)(V) = 1 for some x with < fi(x) < oo and V G Vo- For 
every z satisfying < fi(z) < oo, one obtains a(z)(I x ) > [since u(l x nl z ) > 
0] and this implies a(z)(V) > 0. Hence, condition (12) holds. Next, suppose 
A* is a metric space, IA the Borel <r-field and /i has full topological support. 
Then, another sufficient condition for (11) is: 

0) f( x ^y) — h(y) for all (x,y) and some ;y-integrable function h; 
(jj) x l— > f( x ^y) is continuous for fixed y G y; 

(jjj) for each V G Vo, there is x G X with u(I x n V) > and i/(/ x n V c ) > 0. 
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Under (j)-(jj), in fact, fx is a real continuous function and x \— > a(x)(V) is 
continuous on the set {fx > 0} for all V £ Vq. Also, since {fx > 0} is open 
and \x has full topological support, P(X € U) > whenever £/ is open and 
U (~l {/i > 0} ^ 0. Hence, conditions (i)-(iii) (mentioned in the first part 
of this example) are satisfied with T = {fx > 0}. Recall that (jjj) holds if 
{fx > 0} is connected (as well as in some other situations). 

We close the paper with two remarks. 

Remark 4.6 (Uniform and geometric ergodicity). Let /, fx, fi and D 
be as in the proof of Theorem 4.5, where J- is countably generated and 
P <C n x v. Suppose that 

sluxv<f and fi<tlu 

for some constants s, t > and U &14, V G V with P(£7 x U) > 0. 

Then, (X n , Y n ) is ergodic on Sq. In addition, (X n ,y n ) is uniformly ergodic 
on D x y, in the sense that 

sup \\K n (uj,-)-P\\<qr n 

for some constants q > and r < 1. Also, the convergence rate r can be 
taken such that r < 1 — (V). 

In fact, fx = on C/ c implies / = on U c x y, n x u-a.e. Thus, (6) holds 
by Corollaries 3.5 and 3.7, and (X n ,Y n ) is ergodic on Sq by Theorem 4.5. 
Since fj,(U) > 0, v(V) > and 

8fi(U)u(V) < f fd{fixu) = P{U x V), 

JUxV 

then < v(V) < oo, and one can define the probability measure 

^ = ~r^ I Ic(a,b)Iv(b)^--P(d(a,b)), C^T. 

Since fx = on U c , then D C {/i > 0} C U. Therefore, for each to = (x,y) € 
D x y, one obtains 

C) = J l c (a, b)^-P(d(a, b)) 

>~ J Ic(a,b)I v (b)j±-P(d(a,b)) = %(VWC), CGf. 

Thus, D x y is a small set such that P(Z) x^) = 1, and this implies uniform 
ergodicity of (X n ,y n ) on D x see pages 1714-1715 and Proposition 2 of 
[10]. 
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The previous assumptions can be adapted to obtain geometric ergodicity, 
in the sense that (X n , Y n ) is ergodic on So and ||-fT n (cL>, •) — P\\ < q(tv)r n for 
P-almost all u>, where r E (0, 1) is a constant and q a function in L\(P). As an 
example (we omit calculations), (X n ,Y n ) is geometrically ergodic whenever 

/ and fi are bounded, / > s on U x V, f = on U c x V c , 

for some s > 0, U EW, V such that 



Note that, for the above conditions to apply, fi must be a finite measure. 
Even if long to be stated, such conditions can be useful. They apply, for 
instance, when is the Borel unit square, [i = v = one-dimensional 

Lebesgue measure, and P uniform on the lower or upper half triangle. 

Remark 4.7 (The /c-component case). This paper has been thought 
and written for the two-component case, but its contents extend to the k- 
component case, with k > 2 any integer. In particular, Theorems 4.2, 4.4 and 
4.5 can be adapted to the fc-component Gibbs sampler. We just mention that, 
in general, the involved sub-cr-fields are Ai = cr(Zi, . . . , . . . , Z^j, 

i = 1, . . . , k, where Zj denotes the ith coordinate projection on the product 
of some k measurable spaces. 
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